Integration of Tone Related Features for Mandarin Speech Recognition by a One-pass Search Algorithm

نویسندگان

  • Pui-Fung WONG
  • Man-Hung SIU
چکیده

How to model Chinese tones and integrate them into an HMM-based recognizer for Chinese recognition has been an area of interest to researchers. In this paper, we propose the use of a polynomial trajectory model to represent pitch shape. We further propose an efficient one-pass search approach that integrates the tone likelihood into the Viterbi search procedure. We report a number of experimental results on tone classification and tonal syllable recognition in the 863 corpus. While the improvement in the tonal syllable accuracy is small, it nevertheless shows the feasibility of the proposed approaches.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Phonetic state tied-mixture tone modeling for large vocabulary continuous Mandarin speech recognition

This paper presents a new approach to tone modeling for continuous Mandarin speech recognition. Mandarin tones provide rich information for speech recognition. In this paper, we treat the tone as an attribute of the final vowel part of a Mandarin syllable. Separate distributions are estimated for cepstral coefficients and pitch features respectively, and the phonetic state tied-mixture techniqu...

متن کامل

Incorporating Pitch Features for Tone Modeling in Automatic Recognition of Mandarin Chinese

Tone plays a fundamental role in Mandarin Chinese, as it plays a lexical role in determining the meanings of words in spoken Mandarin. For example, these two sentences R R (I like horses) and R M (I like to scold) differ only in the tone carried by the last syllable. Thus, the inclusion of tone-related information through analysis of pitch data should improve the performance of automatic speech...

متن کامل

Improved Mandarin Speech Recognition by Lattice Rescoring with Enhanced Tone Models

Tone plays an important lexical role in spoken tonal languages like Mandarin Chinese. In this paper we propose a two-pass search strategy for improving tonal syllable recognition performance. In the first pass, instantaneous F0 information is employed along with corresponding cepstral information in a 2-stream HMM based decoding. The F0 stream, which incorporates both discrete voiced/unvoiced i...

متن کامل

Features of stimulation affecting tonal-speech perception: implications for cochlear prostheses.

Tone languages differ from English in that the pitch pattern of a single-syllable word conveys lexical meaning. In the present study, dependence of tonal-speech perception on features of the stimulation was examined using an acoustic simulation of a CIS-type speech-processing strategy for cochlear prostheses. Contributions of spectral features of the speech signals were assessed by varying the ...

متن کامل

Prosodic modeling for improved speech recognition and understanding

The general goal of this thesis is to model the prosodic aspects of speech to improve humancomputer dialogue systems. Towards this goal, we investigate a variety of ways of utilizing prosodic information to enhance speech recognition and understanding performance, and address some issues and difficulties in modeling speech prosody during this process. We explore prosodic modeling in two languag...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003